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(54) Tide: SYNTHESIS OF PROTEINS BY NATIVE CHEMICAL LIGATION 



(57) Abstract 

Proteins of moderate size having native peptide backbones are produced by a method of native chemical ligation. Native chemical 
ligation employs a chemoselective reaction of two unprotected peptide segments to produce a transient thioester-linked intermediate. The 
transient thioester-linked intermediate then spontaneously undergoes a rearrangement to provide the full length ligation product having a 
native peptide bond at the ligation site. Full length ligation products are chemically identical to proteins produced by cell free synthesis. Full 
length ligation products may be refolded and/or oxidized, as allowed, to form native disulfide-containing protein molecules. The technique 
of native chemical ligation is employable for chemically synthesizing full length proteins. 
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SYNTHESIS OP PROTEINS BY NATIVE CHEMICAL LIGATION 

Specification 

Field of Invention : 

The invention relates to methods and intermediates 
for chemically ligating two oligopeptides end to end with 
an amide bond. More particularly, the invention relates 
5 to methods and intermediates for chemically ligating 
oligopeptides wherein an unoxidized N-terminal cysteine 
of a first oligopeptide condenses with a C-terminal 
thioester of a second oligopeptide to form a p- 
aminothioester intermediate which spontaneously 
10 rearranges intramolecularly to form an amide bond and the 
ligation product. 

Government Rights : 

The invention disclosed herein was supported in 
15 part by Grants Number R01 GM 48897-01, Number P01 GM 

48870-03, and Number GM 50969-01 from the National 
Institutes of Health. The United States government may 
have certain rights to this invention. 

20 Background: 

Proteins may be synthesized chemically, 
ribosomally in a cell free system, or ribosomally within 
a cell. Advances in each of these areas have 
significantly improved access to many proteins but have 

25 also stimulated demand for yet further improvements. 

Proteins owe their diverse properties to the 
precisely folded three dimensional structures of their 
polypeptide chains. The three dimensional structure of 
a protein determines its functional attributes. However, 

30 at present, it is difficult to predict and/ or fully 
explain the biological properties of a protein from its 
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three dimensional structure alone. A better 

understanding of how structure determines the biological 
properties of a protein can be achieved by systematically 
varying the covalent structure of the molecule and 
5 correlating the effects with the folded structure and 
biological function. Accordingly , there is an increased 
demand for enhanced synthetic techniques for synthesizing 
new proteins and protein analogs. 

Techniques derived from recombinant DNA-based 

10 molecular biology can be employed to facilitate the 
expression of proteins in genetically engineered micro- 
organisms. The use of site-directed mutagenesis, as 
disclosed by M. Smith (Angew. Chem. Int. Ed. Engl. 
(1994): vol. 33, p 1214), enables the preparation of 

15 large numbers of modified proteins in useful amounts for 
systematic study, e.g., C. Eigenbrot and A. Kossiakoff, 
Current Opinion in Biotechnology (1992): vol. 3, p 333. 
The use of innovative approaches increases the range of 
amino acids that can be incorporated in expression 

20 systems and promises to significantly extend the utility 

of biosynthetic modification of the covalent structure of 
proteins. (C. J. Noren et al., Science (1989): vol. 244, 
p 182 (1989); J. A. Ellman et al., Science (1992): vol. 
255, p 197.) However, there appear to be limitations 

25 inherent to the nature of ribosomal protein synthesis. 

(V. W. Cornish, et al., Proc. Natl. Acad. Sci. USA 
(1994) : vol. 91, p 2910.) 

Chemical synthesis of proteins has also 
contributed to the exploration of the relationship of 

30 protein structure to function. Stepwise solid phase 
synthesis has permitted the de novo preparation of small 
proteins. (T. W. Muir et al,, Curr. Opin. Biotech. 
(1993): vol. 4, p 420.) There are also several examples 
of the use of stepwise solid phase synthesis of whole 

35 proteins to explore the molecular basis of biological 
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function, (M. Miller, et al., Science (1989): vol. 246, 
p 1149; A. Wlodawer, et al., Science (1989): vol. 245, p 
616; L.H. Huang, et al., Biochemistry (1991): vol. 30, p 
7402; and K. Rajarathnam, et al., Science (1994): vol. 
5 264, p 90.) 

Semi-synthesis through the conf ormationally- 
assisted religation of peptide fragments can also be 
employed, in special instances, to study of the 
structure/ function relationship of proteins. (R. E. 

10 Of ford, "Chemical Approaches to Protein Engineering", in 
Protein Desicrn and the Development of New therapeutics 
and Vaccines . J. B. Hook, G. Poste, Eds., (Plenum Press, 
New York, 1990) pp. 253-282; C. J. A. Wallace, et al., J. 
Biol. Chem. (1992): vol. 267, p 3852. An important 

15 extension of the semisynthesis approach is the use of 
enzymatic ligation of cloned or synthetic peptide 
segments. (L. Abrahmsen, et al., Biochemistry (1991): 
vol. 30, p 4151; T. K. Chang, et al., Proc. Natl. Acad. 
Sci. USA (1994): in press.) Although the above 

20 methodologies have been successfully applied to the 
synthesis of proteins and protein analogs, T. W. Muir et 
al., report that there is a continued interest in the 
wider application of the tools of organic chemistry to 
the study of proteins (Curr. Opin. Biotech. (1993): vol. 

25 4, p 420.) 

Stephen Kent et al. recently introduced the 
chemical ligation of unprotected peptide segments as an 
improved route to the total synthesis of proteins. (M. 
Schnlzer, et al., Science (1992): vol., 3256, p 221.) 

30 Chemical ligation involves the chemoselective reaction of 
unprotected peptides to give a product with an unnatural 
backbone structure at the ligation site. Use of 
unprotected peptides circumvented the difficulties 
inherent to classical chemical synthesis, viz complex 

35 combinations of protecting groups that lead to limited 
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solubility of many synthetic intermediates, e.g. K. 
Akaji, et al., Chem. Pharm. Bull. (Tokyo) (1985): vol. 
33, p 184. In contrast, the chemical ligation technique 
has allowed us to make good use of the ability to 
5 routinely make, purify, and characterize unprotected 
peptides 50 or more residues in length. Using optimized 
stepwise solid phase methods the preparation in good 
yield and high purity of peptides up to 60 residues is 
routine. In favorable cases, peptides of 80+ residues 

10 can be prepared. (M. Schnolzer, et al., Jnt. J. Pept. 
Prot. Res. (1992): vol. 40, p 180-193.) 

The key aspect of the above approach to chemical 
ligation is the use of a chemoselective reaction to 
specifically and unambiguously join peptides by formation 

15 of an unnatural (i.e. non-peptide) backbone structure at 
the ligation site. It has permitted the facile 
preparation of a wide range of backbone-modified 
proteins, including analogues of protein domains, e.g., 
ligated 10F3, the integrin-binding module of fibronectin: 

20 95 residues (M. Williams, et al., J . Am. Chem. Soc. 

(1994): in press.) The catalytic contribution of 
flap-substrate hydrogen bonds in HIV-l protease has been 
elucidated by the chemical synthesis of a homodimer of 99 
residue subunits of this protein by chemical ligation. 

25 (M. Baca, et al., Proc. Natl. Acad. Sci. U.S., (1993): 

vol. 90, p 11638.) Chemical ligation has also proven to 
be useful for the routine, reproducible synthesis of 
large amounts of proteins in high purity with full 
biological activity (20). (R. C. deLisle Milton, et al., 

30 "Synthesis of Proteins by Chemical Ligation of 

Unprotected Peptide Segments: Mirror-Image Enzyme 
Molecules, D- & L-HIV Protease Analogs," in Techniques in 
Protein Chemistry IV , Academic Press, New York, pp. 
257-267 (1992).) 

35 Chemical ligation can also be employed for the 
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straightforward production of protein-like molecules of 
unusual topology, e.g., four-helix bundle 
template-assembled synthetic protein (MW 6647 Da) (P. E. 
Dawson, et al., J. Am. Chem. Soc. (1993): vol. 115, p 
5 7263); homogeneous multivalent artificial protein (MW 

19,916 Da) (K. Rose, J . Am. Chem. Soc. (1994): vol. 31-16, 
p 30) ; artificial neoprotein mimic of the cytoplasmic 
domains of a multichain integrin receptor (MW 14,194 Da) 
(T. W. Muir, et al., Biochemistry, (1994): vol. 33, pp 

10 7701-7708; and peptide dendrimer (MW 24,205 Da) (C. Rao, 
et al., J . Am. Chem. Soc. (1994): vol. 116, p 6975. The 
range of proteins accessible by this technique is limited 
by the size of the synthetic peptide segments. 

A useful extension would occur if one had direct 

15 synthetic access to native backbone polypeptide chains up 
to the size of typical protein domains. (A. L. Berman, et 
al., Proc. Natl. Acad. Sci. USA (1994): vol. 91, p 4044.) 
Chemical ligation would then be employed to string these 
domains together to explore the world of proteins in a 

20 general fashion. 

A modular strategy for the total synthesis of 
proteins has been developed, based on the convergent 
chemical ligation of unprotected peptides has been 
disclosed by L.E. Canne, et al. (presented at the Annual 

25 Meeting of the Protein Society, San Diego, July 1994). 

Protein domains (modules) were prepared by chemical 
ligation of 50-70 residue segments; these domains were 
then stitched together to give the target protein. 
Mutually compatible ligation chemistries are required: 

3 0 intra-domain ligation should optimally yield a stable, 
peptide-like bond; inter-domain ligation will tolerate a 
wider variation of properties of the structure formed at 
the ligation site. 

Straightforward total chemical synthesis of 

35 proteins represents the realization of an important 
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objective of organic chemistry. It raises the exciting 
prospect of unrestricted variation of protein covalent 
structure made possible by general synthetic access, and 
will give new impetus to exploration of the structural 
5 basis of properties such as folding, stability, catalytic 
activity, binding, and biological action. 

What is needed is a technique of native chemical 
ligation which combines the formation of a native peptide 
bond at the ligation site with the advantages of 

10 chemoselective reaction of unprotected peptides. This 

second generation ligation chemistry would significantly 
increase the size of native backbone polypeptides 
directly accessible by total chemical synthesis. It 
could be usefully applied to a wide range of synthetic 

15 targets, including proteins of moderate size, and it 
allows direct access to protein functional domains. 
Native chemical ligation is a foundation stone of a 
general modular approach to the total chemical synthesis 
of proteins. Furthermore, it is compatible with the use 

20 of both chemically synthesized peptides and peptide 

segments derived from other sources. 
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Summary : 

One aspect of the invention is directed to a 
method of native chemical ligation. The method of native 
chemical ligation facilitates the chemical synthesis of 

5 proteins and large oligopeptide. The principle of 
'native chemical ligation' is shown in Scheme 1. The 
first step is the chemoselective reaction of an 
unprotected synthetic peptide-cr-thioester with another 
unprotected peptide segment containing an N-terminal Cys 

10 residue, to give a thioester- linked intermediate as the 
initial covalent product. Without change in the reaction 
conditions, this intermediate undergoes spontaneous, 
rapid intramolecular reaction to form a native peptide 
bond at the ligation site. The target full length 

15 polypeptide product is obtained in the desired final form 
without further manipulation. The general synthetic 
access provided by the method of native chemical ligation 
greatly expands the scope of variation of the covalent 
structure of the protein molecule. 

20 One embodiment of the invention provides a method 

for ligating a first oligopeptide with a second 
oligopeptide end to end for producing an oligopeptide 
product. The first and second oligopeptides are admixed 
in a reaction solution including a catalytic thiol. The 

25 catalytic thiol may be an unconjugated mercaptan or a 
conjugated thiol. Preferred catalytic thiols include 
benzyl mercaptan, thiophenol, l-thio-2-nitrophenol, 2- 
thio-benzoic acid, 2-thio-pyridine, 4-thio-2- 
pyridinecarboxylic acid, and 4-thio-2-nitro-pyridine. 

30 The first oligopeptide includes a Oterminal thioester. 

The second oligopeptide includes an N-terminal cysteine 
having an unoxidized sulfhydryl side chain. The 
unoxidized sulfhydryl side chain of the N-terminal 
cysteine is then condensed with the C-terminal thioester 

35 to produce an intermediate oligopeptide which links the 
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first and second oligopeptides with a £-aminothioester 
bond. The 0-aminothioester bond of the intermediate 
oligopeptide then undergoes an intramolecular 
rearrangement to produce the oligopeptide product which 
5 links the first and second oligopeptides with an amide 
bond. 

Another aspect of the invention is directed to an 
oligopeptide intermediate which comprises a first 
oligopeptide segment having a C-terminal thioester, a 

10 second oligopeptide segment having a N-terminal cysteine, 
and a 0-aminothioester linkage unit which links the c- 
terminal thioester and the N-terminal cysteine. The p- 
aminothioester linkage unit spontaneously rearranges 
intramolecularly to form an amide bond linking the first 

15 and second oligopeptides segments end to end. 

Another aspect of the invention is directed to a 
method for producing an oligopeptide having a C-terminal 
thioester. The method admixes a resin having a linker 
with an unoxidized thiol with a Boc-amino acid 

20 succinimide ester under reaction conditions to produce a 
Boc-amino thioester-resin. An oligopeptide is then 
assembled onto the Boc-amino thioester-resin by stepwise 
solid phase peptide synthesis. When the oligopeptide is 
complete, the the Boc-amino thioester-resin is cleaved 

25 with HS to produce an oligopeptide having a C-terminal 
thiol. The C-terminal thiol is then converted to an 
oligopeptide having a C-terminal thioester. 

The oligopeptide thioester (a-COSR moiety) of 
Scheme 1 can be readily generated from a corresponding 

30 oligopeptide thiol (-aCOSH) prepared by highly optimized 
stepwise SPPS on a thioester resin. The thioester resin 
was prepared by the method of L.E. Canne et al., 
Tetrahedron Letters (1995): vol. 36 r pp. 1217-1220, 
incorporated herein by reference. The method of Canne 

35 employs the thioester resin disclosed by Blake and 
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Yamashiro (J. Blake, Int. J. Pept. Protein Res. (1981): 
vol. 17, p 273; D. Yamashiro, et al., Int. J. Pept. 
Protein Res. (1988): vol. 31, p 322). Peptide products 
were cleaved, purified, and characterized by conventional 
5 methods. (M. Schnolzer, et al., Jnt. J. Pept. Protein 
Res., (1992): vol. 40, pp 180-193.) 

The Yamashiro methodology activates a thiocarboxyl 
group on a protected oligopeptide with diary 1 disulfides 
to give acyl disulfides (Yamashiro et. al. Int. J. 

10 Peptide Protein Res. (1922): vol. 31, pp 322-334). These 
C-terminal-peptide-acyl-disulf ides are highly reactive 
electrophilic intermediates which are attacked and 
subsequently coupled with an a-amino group on the N- 
terminus of a second peptide to form native peptide 

15 bonds. The reported coupling yields using 2,2 '-dipyr idyl 
disulfide as the activator of the thiocarboxyl group 
afford the desired cr-IB-92 product in 45% yield. Overall 
yields based on the starting resin for a 3-segment 
synthesis of a-IB-92 are reported as 8%, while a 2- 

20 segment synthesis gave 11%. 

Due to the high reactivity of the diaryl disulfide 
bond, the Yamashiro approach requires extensive 
protection and deprotection of amino acid residues 
present in the peptide molecule. The lysine group for 

25 example is protected as a citraconyl derivative because 
of the reactive amine functionality. Additionally, an 
Msc group or tBOC group is used to protect any terminal 
amine functionalities present in the molecule. 

The invention stated herein does not require the 

30 use of any protecting groups for the coupling of two 
oligopeptides because a less reactive (and thus more 
chemoselective) thioester electrophile is used instead of 
the acyl disulfide moiety (Yamashiro's approach). In the 
intermolecular coupling step, this thioester electrophile 

35 requires a more nucleophilic sulfhydryl moiety rather 
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than a free amine. The nucleophilic sulfhydryl moiety 
can be found on cysteine residues. Since the amino and 
hydroxyl functionalities are relatively unreactive to the 
thioester electrophile, a selective coupling of the two 
5 unprotected oligopeptides is achieved with the cysteine 
sulfhydryl moiety. The sulfhydryl group on the cysteine 
of peptide 2 will first attack the thioester of peptide 
1 and form a coupled thioester intermediate. This 
coupled thioester intermediate is concomitantly attacked 

10 by the free a-amino moiety from the cysteine and 
spontaneously rearranges to form the native peptide bond. 
Yields are therefore increased by eliminating protection 
and deprotection steps, since side undesired reactions 
are reduced (Scheme 1) . 

15 The thioester moiety is prepared from a precursor 

thioacid which is obtained by optimized stepwise solid- 
phase peptide synthesis on a aminomethyl resin support 
equipped with a thioester resin linker. The precursor 
thioacid is subsequently generated in liquid HF at 0 °C 

20 in 1 hour (Yamashiro et. al. Int. J . Pept. Protein Res. 

(1988) : vol. 31, p 322) . 

A procedure for the synthesis of the thioester 
linker with use of a stepwise solid phase peptide 
synthesis has been reported by Blake (Blake et. al. Proc. 

25 Natl. Acad. Sci. USA (1981): vol. 78, 4055) and Yamashiro 

(Yamashiro et. al. Int. J. Pept. Protein Res. (1988): 
vol. 31, p 322). This method is undesirable, however, 
because it requires the conversion of Boc-amino acid 
succinimide esters to the corresponding Boc-amino 

30 thioacids with hydrogen sulfide. An improved methodology 
reported herein, utilizes the Boc-amino acid succinimide 
ester directly and therefore avoids the inconvenience and 
hazards of hydrogen sulfide gas (Kent et. al. Tetrahedron 
Lett. (1995): vol. 36, p 1217). 

35 In this method (Scheme 2) , thiol 3 is generated 
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from the reaction of chloride 2 (Yamashiro et. al. Int. 
J. Pept. Protein Res. (1988): vol. 31, p 322) with 
thiourea, followed by hydrolysis of the resulting 
thiouronium salt in aqueous base. Thiol 3 is a general 
5 intermediate which can be reacted with a wide range of 
commercially available Boc-amino acid succinimide esters 
to produce the desired thioester linker 1 which is 
conveniently isolated as the dicyclohexylamine (DCHA) 
salt. 

10 Model studies were undertaken with small peptides 

to investigate the native chemical ligation approach. To 
help explore the mechanism of the reaction, the peptide 
Leu-Tyr-Arg-Ala-Gly-a-COSBzl was reacted with Ac-Cys. 
The exact mass of the resulting ligation product was 

15 determined by electrospray mass spectrometry, and was 
consistent with a thioester- linked peptide as the 
ligation product generated by nucleophilic attack of the 
Ac-Cys side chain on the a-thioester moiety of the 
peptide. Reaction of Leu-Tyr-Arg-Ala-Gly-a-COSBzl with 

20 H-Cys-Arg-Ala-Glu-Tyr-Ser (containing an unblocked a-NH2 
functional group) proceeded rapidly at pH 6.8 (below pH 
6 the reaction proceeded very slowly, suggesting the 
involvement of the ionized thiolate form of the Cys side 
chain) , and gave a single product of the expected mass. 

25 This product lacked susceptibility to nucleophiles, and 
had the ability to form disulf ide-linked dimeric 
peptides, indicating unambiguously the formation of a 
native amide bond at the ligation site. These studies 
were consistent with the mechanism shown in Scheme 1, in 

30 which the initial thioester ligation product was not 
observed as a discrete intermediate because of the rapid 
rearrangement to form a stable peptide bond. Facile 
intramolecular reaction results from the favorable 
geometric arrangement of the a-NH 2 moiety with respect to 

35 the thioester formed in the initial chemoselective 
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ligation reaction. Use of such 'entropy activation' for 
peptide bond formation is based on principles enunciated 
by Brenner. (M. Brenner, in Peptides. Proceedings of the 
Eighth European Peptide Symposium H. C. Beyerman, Eds. 
5 (North Holland, Amsterdam, 1967) pp. 1-7.) The concept 

of 'entropy activation' for peptide bond formation has 
been more recently adopted by D. S. Kemp et al. (J. Org. 
Chem. (1993): vol. 58, p 2216) and by C.-F. Liu, et al. 
(J. Am. Chem. Soc. (1994): vol. 116, p 4149). 

10 Several model peptides have been synthesized by 

the method of native chemical ligation. The successful 
synthesis of these model peptides establish that native 
chemical ligation is generally applicable to peptides 
containing the full range of functional groups normally 

15 found in proteins. Even free internal Cys residues may 
be present in either of the reacting segments. Internal 
Cys residues can undergo ester exchange with the 
peptide-a-thioester component; however, this reaction is 
unproductive because no rearrangement to the amide bond 

20 can occur; the thioester formed is readily reversible and 
remains a productive part of the reacting system. As 
disclosed herein, native chemical ligation is limited to 
reaction at an N-terminal Cys residue. It is important 
to prevent the side chain thiol of this Cys from 

25 oxidizing to form a disulfide linked dimer, because this 
is unreactive in the ligation. An excess of thiol 
corresponding to the thioester leaving group was used to 
keep the Cys residues in reduced form without interfering 
with the ligation reaction. The amino-terminal peptide 

3 0 segment must be prepared by chemical synthesis to equip 
it with the necessary a-COSR functionality. Furthermore, 
for optimal ligation, this component should have an 
unhindered (i.e. non ^-branched) C-terminal amino acid. 
Solubilizing agents such as urea or guanidine 

35 hydrochloride did not interfere with the ligation and 
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could be used to enhance the concentration of peptide 
segments, and thus increase the reaction rate. 

Further model reactions demonstrate that the use 
of better thioester leaving groups results in faster 
5 ligation reactions. We applied this observation to the 
native chemical ligation of peptides from the 
extracellular domain of a human cytokine receptor (R. 
D'Andrea, et al., Blood, (1994): vol. 83, p 2802.) as 
shown in Scheme 5. Use of the 5-thio-2-nitrobenzoic acid 

10 (-SNB) leaving group, corresponding to the reduced form 

of Elman's reagent, gave rapid high yield reaction. As 
described below in connection with Scheme 5, the reaction 
between the peptide segments was observed to have gone 
essentially to completion in less than 5 minutes, giving 

15 the 50 residue product with a native peptide bond at the 
site of ligation. Thus, rapid native chemical ligation 
can be achieved by use of a thioester leaving group with 
suitably tuned properties. 

Application of the native chemical ligation method 

20 to the total synthesis of a protein molecule was 
illustrated by the preparation of human inter leukin 8 
(IL-8). (M. Baggiolini, et al., FEBS Lett. (1989): vol. 
307, p 97; I. Clark-Lewis, et al., J. Biol. Chem. (1994): 
vol. 269, p 16075 (1994); I. Clark-Lewis, Biochemistry 

25 (1991): vol. 30, p 3128; and K. Rajarathnam, et al., 

(1994): Biochemistry, (1994): vol. 29, p 1689.) The 72 
amino acid polypeptide chain contains four Cys residues, 
which form two functionally critical disulfide bridges in 
the native protein molecule. The total synthesis of IL-8 

30 is shown in Scheme 7. The two unprotected synthetic 
peptide segments reacted cleanly to give the full length 
polypeptide chain in reduced form without further 
chemical manipulation (9). This successful ligation was 
particularly significant because the 33- and 39-residue 

35 IL-8 segments each contained two Cys residues, and 
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together encompassed 18 of the 20 genetically encoded 
amino acids found in proteins. The purified product was 
folded and oxidized as previously described, to give IL-8 
with a mass precisely 4 daltons less than that of the 
5 original ligation product, indicating the formation of 
two disulfide bonds. The properties of this folded 
product were identical to those of previously studied 
authentic IL-8 samples. Titration in an assay for 
neutrophil elastase release demonstrated that the 

10 potencies (ED50 = 0.3nM) and maximal responses of the 

folded, ligated [Ala33]IL-8 and the corresponding 
molecule obtained by conventional synthesis were 
indistinguishable and identical to native sequence IL-8. 
This result unambiguously confirmed the formation of a 

15 peptide bond at the ligation site, because the thioester- 
to-amide rearrangement must have taken place to give the 
free Cys 34 side chain that formed the native disulfide 
bond (see Scheme 7) . 

Proteins are usually studied by expression in 

20 genetically engineered micro-organisms using the methods 
of recombinant DNA-based molecular biology. Methods such 
as site-directed mutagenesis have had a major impact on 
the ability to prepare large numbers of modified proteins 
in useful amounts for systematic study. Innovative 

25 approaches have increased the range of amino acids that 
can be incorporated in expression systems and promise to 
significantly extend the utility of biosynthetic 
modification of the covalent structure of proteins. 
However, there appear to be limitations inherent to the 

30 nature of ribosomal protein synthesis. 

Wieland discloses a method for synthesizing 
dipeptides using a thioester intermediate. (Wieland et. 
al. Liebigs Ann. Chem. (1953): vol. 580, p 159.) Wieland 
utilizes the reaction of S-glycyl-(or other un-branched 

35 aminoacyl-) thiophenols with cysteine. Thus, the 
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sulfhydryl group on the cysteine residue first attacks 
the thioester of the S-glycyl-thiophenol and forms a 
coupled thioester intermediate. This coupled thioester 
intermediate is concomitantly attacked by the free a- 
5 amino moiety from the cysteine and spontaneously 
rearranges to form the native peptide bond. 

A limitation of the Wieland approach is the size 
of the molecules utilized (only mono-amino acids are 
coupled to cysteine) and stems from the methodology used 

10 for the synthesis of the thioester. To form the 

thioester, Wieland 's approach requires the activation of 
the terminal carboxylic acid as a mixed anhydride, acid 
chloride or thioacid. A problem arises if an acidic 
moiety is present in an amino acid residue such as Asp(D) 

15 and Glu(E). In these cases, the Wieland produces an 

undesired side reaction and therefore requires a complex 
protecting group strategy, particularly if oligopeptides 
are synthesized. 

The invention described herein eliminates the need 

20 for an elaborate protecting group strategy since the 
oligopeptide-thioester moiety is derived from a precursor 
thioacid. This precursor thioacid (peptide-a-COSH) is 
synthesized by a standard stepwise solid-phase peptide 
synthesis on an aminomethyl resin support, equipped with 

25 a thioester resin linker. The precursor thioacid is 
cleaved from the linker/resin almost quantitatively (99%) 
in liquid HF at 0 °C for 1 hour. 

The thioester peptide (peptide-a-COSR) can be 
synthesized in two general ways: 

30 (l) Reaction of a crude lyophilized thioacid peptide 

(peptide-a-COSH) with Ellman's reagent (5, 5'-dithiobis-2- 
nitrobenzoic acid, available from Aldrich company) at pH 
5.5 (2.0 equivalents), 6M Guanidine in lOOmM Na acetate 
buffer. This gives the SNB-thioester peptide (peptide-a- 

35 COSNB) which is subsequently purified by reversed phase 
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high performance liquid chromatography (RPHPLC) . 

(2) Reaction of a crude lyophilized thioacid peptide 
(peptide-a-COSH) with benzyl bromide at pH 4.0, 6M 
guanidine and lOOmM Na acetate buffer. The benzyl 
5 thioester (peptide-a-COSBn) is then purified by RPHPLC. 

The conditions stated above, permit the formation of 
an unprotected oligonucleotide which is equipped with the 
activated thioester. Subsequent reaction with a second 
peptide containing a terminal cysteine residue, permits 

10 a facile coupling with the formation of a native peptide 
bond and can generate oligopeptide chains of 100 or more 
amino acid residues (Scheme 1) . 

In favorable cases, chemical synthesis has already 
made important contributions to the exploration of the 

15 relationship of protein structure to function. Stepwise 
solid phase synthesis has permitted the de novo 
preparation of small proteins (14) and there have been 
several notable examples of the use of this method of 
total protein synthesis to explore the molecular basis of 

20 biological function. Another method that has in special 
instances allowed chemistry to be applied to the study of 
proteins is semi-synthesis through the conf ormationally- 
assisted religation of peptide fragments. An important 
extension of the semisynthesis approach is the use of 

25 enzymatic ligation of cloned or synthetic peptide 
segments. Although these methods currently have severe 
limitations, there continues to be serious interest in 
the wider application of the tools of organic chemistry 
to the study of proteins. 

30 Native chemical ligation provides precisely that 

capability. It combines the formation of a native 
peptide bond at the ligation site with the advantages of 
chemoselective reaction of unprotected peptides. This 
second generation ligation chemistry dramatically 

35 increases the size of native backbone polypeptides 
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directly accessible by total chemical synthesis. It can 
be usefully applied to a wide range of synthetic targets, 
including proteins of moderate size, and it allows direct 
access to protein functional domains. Native chemical 
5 ligation is a foundation stone of a general modular 
approach to the total chemical synthesis of proteins. 
Furthermore, it is compatible with the use of both 
chemically synthesized peptides and peptide segments 
derived from other sources. 

10 Straightforward total chemical synthesis of proteins 

represents the realization of an important objective of 
organic chemistry. It provides for unrestricted 
variation of protein covalent structure made possible by 
general synthetic access, and provides new impetus to 

15 exploration of the structural basis of properties such as 
folding, stability, catalytic activity, binding, and 
biological action. 

In an alternative embodiment, the car boxy-terminal 
peptide segment or protein module can be expressed by 

20 standard recDNA means; provided the product contained an 
N-terminal Cys residue, it could be reacted with the 
synthetic amino- terminal peptide-cr-COSR using the native 
chemical ligation described here to give a product in 
which part of the protein had derived from chemical 

25 synthesis and part from ribosomal synthesis. 
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Detailed Description; 

Peptide-ce-thioacid formation 

5 A typical procedure for the formation and 

utilization of the thioester resin linker for use in the 
solid phase synthesis of peptide-a-thioacids is as 
follows: (Kent et. al. Tetrahedron Lett. (1995) : vol. 36, 
p 1217) . 

10 

4- (cr-Mercaptobenzyl) phenoxyacetic acid, dicyclo- 
hexylamine (3) Scheme 2. A mixture of 2, formed using 
the conditions as established by Yamashiro et. al. Int. 
J. Pept. Protein Res. (1988): vol. 31, pp 322-334, (7.5 

15 grams, 27 mmol) , thiourea (2.3 g, 30 mmol) , and ethanol 
(100 mL) were heated to reflux (conditions as reported by 
Koenig et al J . Org. Chem. (1958): vol. 23, pp 1525- 
1530) . After 4 hours, conversion to the thiouronium salt 
was essentially complete as shown by TLC (90:5:5 

20 chloroform:Methanol:Acetic acid). 10N NaOH (30 ml) was 
added and the reflux continued for 2-3 hours. After 
cooling to room temperature, the reaction mixture was 
concentrated in vacuo to approximately half the original 
volume, acidified with concentrated HC1 (to pH 2.0), and 

25 extracted with ethylacetate (4x30 mL) . The combined 
ethylacetate extracts were washed with saturated NaCl (1 
x 30 mL) and dried over MgS0 4 . The volatile materials 
were removed in vacuo. The resulting oil was dissolved 
in ethylacetate (100 mL) and any insoluble material 

30 filtered. DCHA (dicyclohexylamine - available from 
Aldrich company), (6.0 mL, 30 mmol) was added to the 
filtrate with stirring. Within a few minutes, a white 
solid began to precipitate. Diethylether (150 mL) was 
added and the suspension cooled at -20 °C for several 

35 hours. The resulting white solid was filtered, washed 
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with Diethylether, and dried under vacuum to give 3 (10.3 
g, 23 mmol, 84%): ! H NMR (CDC1 3 ) : 6 7.30 (m, 7H) , 6.82 
(d, 2H, J=8.7 Hz), 5.39 (br s, 1H) , 4.40 (s, 2H) , 2.81 
(to, 2H) , 2.23 (br s, 1H, ex D 2 0) , 1.88-1.02 (comp m, 20H) ; 
5 FAB MS (cesium ion): calc for [Cj^NOjS, H + ] 456.2572, 

found 456.2572. Anal. Calcd for C^E^O^S: C, 71.17; H, 
8.18; N, 3.07; S, 7.04. Found: C, 71.11; H, 8.41; N, 
3.08; S, 7.09. 

10 General Synthesis of Boc-amino thioester linker (l), 

dicyclohexylamine salt (scheme 2) . A mixture of 3 (3.67 
mmol) , Boc-Ala-OSu (available from Novabiochem Corp.) 
(3.68 mmol), DIEA (diisopropylethylamine-5 . 74 mmol) , 
dimethylformamide (35 mL) and methylene chloride (4 mL) 

15 was stirred at room temperature. After several hours, 
the initial white suspension completely dissolved to give 
a clear, colorless solution. After 24 hours, the 
reaction mixture was poured into IN HC1 (150 mL) and 
extracted with ethylacetate (4 x 35 mL) . The combined 

20 ethylacetate extracts were washed with IN HC1 (2 x 30 

mL) , H 2 0 (1 x 30 mL) , saturated NaCl (1x30 mL) and dried 
over MgS0 4 . Volatiles were removed in vacuo. The 
resulting oil was purified by flash chromatography 
(925:50:25 Chloroform :MeOH: acetic acid) to give an oil 

25 contaminated with Acetic acid. To remove residual Acetic 
acid, the oil was dissolved in Chloroform (40 mL) and 
washed with 0.1 N HC1 (7 x 10 mL) , saturated NaCl (1 x 10 
mL) and dried over MgS0 4 . Volatiles were removed in vacuo 
to give 1 as an oil. This oil was dissolved in 

30 diethylether (10 mL) to which was added dicyclohexylamine 
(1 equivalents). Hexane (100 mL) was added with stirring 
to separate the dicyclohexylamine salt of 1 as a thick 
oil from any unreacted dicyclohexylamine. Solvents were 
decanted from the oil and the oil dissolved in CH 2 C1 2 (30- 

35 40 mL) . The resulting solution was concentrated in vacuo 
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to give the dicyclohexylamine salt 1 as a white foamy 
solid (scheme 2). 

An example of the linkage and synthesis on the resin 
is as follows: 4-[a-(Boc-Ala-S)benzyl]phenoxyacetic acid 
5 (0.80 mmol) is added in 9 ml methylene chloride to 1.00 

g aminomethyl-resin (0.40 mmol) and treated at 0 °C with 
1.33 mL 0.6 M DCCI (dicyclohexylcarbodiimide) in 
methylene chloride for 15 min and at 24 °C for 30 
minutes. The product is then subjected to standard 

10 solid-phase peptide synthesis conditions (Kent et. al. 

Tetrahedron Letters (1995): vol. 36, p 1217; J. Blake, 
Int. J. Pept. Protein Res. (1981): vol. 17, p 273). 
Once the desired chain is synthesized, the peptide resin 
(approx. 45 /xrool original load) is treated in 8 mL liquid 

15 HF (0.8 mL anisole) at 0 °C for 1 hour. After 
evaporation with nitrogen, the residue is washed with 
ethyl acetate. The solid is subsequently stirred in 
water (approx. 15 mL) at 0 °C while adjusting the pH to 
6.0 with solid ammonium bicarbonate. Filtration and 

20 lyophilization gives the crude thioacid product which can 
be further purified by preparative HPLC in 30 mg batches. 

Preparation of the thioester terminal peptide segment 

25 The a-COSR thioester peptide can be synthesized in two 
general ways: 

(1) Reaction of a crude lyophilized thioacid peptide 
with Ellman's reagent (5, 5'-dithiobis-2-nitrobenzoic 
acid, available from Aldrich company) at pH 5.5 (2.0 

30 equivalents), 6M Guanidine in lOOmM Na acetate buffer. 

This gives the SNB-thioester peptide which is 
subsequently purified by reversed phase high performance 
liquid chromatography (RPHPLC) . 

(2) Reaction of a crude lyophilized thioacid peptide 
35 with benzyl bromide at pH 4.0, 6M guanidine and lOOmM Na 
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acetate buffer. The benzyl thioester is then purified 
by RPHPLC. 

The conditions stated above, permit the formation of 
an unprotected oligonucleotide which is equipped with the 
5 activated thioester. Subsequent reaction with a second 
peptide containing a terminal cysteine residue, permits 
a facile coupling with the formation of a native peptide 
bond and can generate oligopeptide chains of 100 or more 
amino acid residues (Scheme l) . 



10 
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Example l 

The model peptide Leu-Tyr-Arg-Ala-Gly-aCOSH 
5 (Sequence No. : 1) is prepared by optimized stepwise 

solid-phase peptide synthesis on an aminomethyl resin. 
The thioester resin linker is prepared by a generalized 
version as adopted from Kent et. al. Tetrahedron Letters, 
(1995): vol. 36, p 1217; J. Blake, Int. J. Pept. Protein 

10 Res. (1981): vol. 17, p 273; D. Yamashiro and C.H. Li, 
ibid. (1988): vol. 31, p 322. Once the desired chain is 
synthesized, the peptide resin (approx. 45 in&ol original 
load) is treated in 8 mL liquid HF (0.8 mL anisole) at 0 
°C for 1 hour. After evaporation with nitrogen, the 

15 residue is washed with ethyl acetate. The solid is 
subsequently stirred in water (approx. 15 mL) at 0 °C 
while adjusting the pH to 6.0 with solid ammonium 
bicarbonate. Filtration and lyophilization gives the 
crude thioacid product which can be further purified by 

20 preparative HPLC in 30 mg batches. 

The thioester terminal peptide segment is 
subsequently prepared from the thioacid fragment by 
chemical synthesis to equip it with the necessary cr-COSR 
functionality where R is an alkyl group such as benzyl, 

25 5-thio-2-nitrobenzoic acid (-SNB) , thiophenol, etc. The 

use of better thioester leaving groups resulted in faster 
ligation reactions. Thus, the model peptide Leu-Tyr-Arg- 
Ala-Gly-aCOSH (Sequence No.: 1) is first converted to the 
thiobenzylester by reaction with benzyl bromide (15 

30 equivalents) in 6.0 M guanidine-HCl , pH 4.6, sodium 

acetate buffer to form Leu-Tyr-Arg-Ala-Gly-aCOSBn 
(Sequence No.: 3). The resulting peptide is purified 
under standard reversed-phase high-performance liquid 
chromatography (HPLC) conditions using approximately 20- 

35 45% acetonitrile at 1% per minute; monitored at 214 nm. 
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For the peptide H-Cys-Arg-Ala-Glu-Tyr-Ser (Sequence 
No.: 2), solid phase methods allow the preparation of 
peptides of up to 60 residues in good yield and high 
purity as described in M. Schnolzer, P. Alewood, D. 
5 Alewood, S.B. H. Kent, Int. J. Pept. Protein Res. 
(1992) : vol. 40, 180. 

First, to explore the mechanism of the reaction, the 
peptide Leu-Tyr-Arg-Ala-Gly-aCOSBn (Bn, benzyl; sequence 
No.: 3) was reacted with Ac-Cys (containing a blocked a- 

10 NH 2 functional group - commercially available from 
Novabiochem corp.). The exact mass of the resulting 
ligation product, Leu-Tyr-Arg-Ala-Gly-aCOS-CH 2 C(NHAc) C0 2 H 
(Sequence No.: 4), was determined by electrospray mass 
spectrometry and was consistent with a thioester- linked 

15 peptide as the ligation product generated by nucleophilic 
attack of the Ac-Cys side chain on the a-thioester moiety 
of the peptide. 

Finally, the reaction of Leu-Tyr-Arg-Ala-Gly-crCOSBn 
(Sequence No.: 3) with Cys-Arg-Ala-Glu-Tyr-Ser (Sequence 

20 No.: 2, containing an unblocked a-NH 2 functional group) 
proceeded rapidly at pH 6.8 (below pH 6.0, the reaction 
proceeded very slowly, suggesting the involvement of the 
ionized thiolate of the Cys side chain at pH 6.8; scheme 
3) and gave a single product of the expected mass. The 

25 peptides Leu-Tyr-Arg-Ala-Gly-aCOSBn + Cys-Arg-Ala-Glu- 

Tyr-Ser were reacted in 0 . 1 M phosphate buffer at pH 6 . 8 , 
6.0 and 4.7 at 25 °C. After l hour, the reactions had 
proceeded as follows: at pH 6.8 >95%; at pH 6.0 
approximately 10%; and at pH 4.7, approximately 1.0% of 

30 ligated product Leu-Tyr-Arg-Ala-Gly-Cys-Arg-Ala-Glu-Tyr- 

Ser (Sequence No.: 5). As observed by HPLC, scheme 3 
shows the pH dependence of the reaction after 1 hour and 
at 25 °C. This product lacked susceptibility to 
nucleophiles and had the ability to form disulf ide-linked 

35 dimeric peptides, indicating unambiguously the formation 
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of a native amide bond at the ligation site. 

Another, unpublished model using the 2- 
thioacetic acid derivative Leu-Tyr-Arg-Ala-Gly-SCH 2 COOH 
(Sequence No.: 6 - , formed from attack of the thioacid 
5 Leu-Tyr-Arg-Ala-Gly-SH (Sequence No.: 1), onto 2- 
bromoacetic acid in methylene chloride) + Cys-Arg-A-la- 
Glu-Tyr-Ser (Sequence No.: 2) was ligated at pH 6.8 in 
0.2 M phosphate buffer, at 45 °C. After 1.0 hour the 
reaction had proceeded to 80% as observed in scheme 4 by 
10 HPLC. The isolation of oxidation products from the 
ligated Leu-Tyr-Arg-Ala-Gly-Cys-Arg-Ala-Glu-Tyr-Ser 
(Sequence No.: 5) and unreacted Cys-Arg-Ala-Glu-Tyr-Ser 
demonstrated the presence of a free thiol ligation 
product . 

15 The native chemical ligation procedure is generally 

applicable to peptides containing the full range of 
functional groups f normally found in proteins. Even free 
internal Cys residues may be present in either of the 
reacting segments. Internal Cys residues can undergo 

20 ester exchange with the peptide-cr-thioester component; 

however, this reaction is unproductive because no 
rearrangement to the amide bond can occur, the thioester 
formed is readily reversible and remains a productive 
part of the reacting system. 

25 The native chemical ligation procedure is limited to 

reaction at an amino-terminal Cys residue. To prevent 
the side chain thiol of this Cys from oxidizing to form 
a disulf ide-linked dimer, an excess of thiol 
corresponding to the thioester leaving group is used to 

30 keep the Cys residues in reduced form without interfering 
with the ligation reaction. In addition, small amounts 
of low molecular weight thiols such as benzyl mercaptan 
or thiophenol are added to the coupling reaction mixture 
to maintain a reducing environment. 
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The addition of thiols increases the reactivity of 
the thioester, particularly if the added thiol is a 
better leaving group than the pre-formed thioester. An 
5 example of this observation is when the benzyl ester is 
converted to a phenyl ester by addition of thiophenol to 
reaction. Reaction yields and rates are substantially 
increased. For example, after 7 hours with benzyl 
mercaptan, the Barnase reaction yielded 25%, while the 
10 thiophenol treatment to the same reaction mixture yielded 
90%) . 

Addition of thiols to the ligation mixture also keeps 
the reaction mixture in a reduced form. This prevents 
oxidation of the reactive N-terminal Cys residues and 

15 when internal Cys residues are present, the thiols reduce 
the formation of intramolecular disulfide bonds. 
Additionally, the reducing environment increases the 
stability of the thioester segment (ligation reactions 
can proceed overnight with little or no hydrolysis at pH 

20 7.5). 

Example 2 

A rapid native chemical ligation is illustrated by 
the synthesis of a peptide segment corresponding to 

25 residues 4 6 to 95 from the external domain of the human 

IL-3 receptor 0-subunit incorporated herein: R.D'Andrea 
et. al., Blood (1994): vol. 83, p 2802. 

Crude synthetic IL-3 Msc (46-76) aCOSH was converted 
to the 5 thio-2-nitrobenzoic acid ester (-C0SNB) by 

30 treatment with 5,5'-dithio-bis(2-nitrobenzoic acid) [10 
equivalents (eq) ] in 8 M urea, pH 4.0, 50 mM ammonium 
acetate buff er [Msc, 2 (methyl-sulf onyl) -ethyloxy-carbonyl 
(Fluka #69227) protecting group is placed on the N- 
terminus using 1.1 equivalents (eq.) 2- 

35 (methylsulf onyl) ethyl 4-nitrophenyl carbonate, l.l eq. 
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diisopropylethylamine, and .5 M dimethylf ormamide] . This 
thioester-containing material was found to be completely 
stable below pH 6*0, and was readily purified under 
standard reversed-phase HPLC conditions using 
5 approximately 20-45% acetonitrile at 1% per minute and 
monitored at 214 nm. 

As shown in scheme 5, ligation is initiated by adding 
IL-3 [Cys77] (77-95) (prepared by standard solid phase 
methods Kent et al. , Int. J. Pept. Protein Res. (1992): 

10 vol. 40, 180) to purified IL-3 Msc (46-76) aCOSNB at the 

stated pH and the reaction is monitored by UV [the 
substituted aryl thiolate leaving group has a 
characteristic UV absorbtion at 412 nm (c^, 4l2mn = 13, 
700 dmhnol^cnr 1 ) ] . At pH 7.0, the reaction is essentially 

15 complete within 5 min. No reaction is observed when 
Msc (46-76) aCOSNB is exposed to a 10-fold molar excess of 
Leu-enkephalin (amino-terminal residue, Tyr) at pH 5.0. 
This control experiment confirms the absolute requirement 
for an amino-terminal Cys residue at the site of ligation 

20 (scheme 5) . 

Purified IL-3 [Cys77] (77-95) (0.98 mM) and IL-3 (46- 
76)aCOSNB (0.9 mM) were reacted in 8 M urea, pH 5.0, 50 
mM ammonium acetate buffer at 23 °C (monitored by 
analytical HPLC 9C18 reversed phase 22.5 to 45% 

25 acetonitrile at 0.7% per minute; 214 nm) . After 1 hour, 
the ligation solution is exposed to the reducing agent 
tris(2-carboxyethyl)phosphine (TCEP) at pH 9.0 and 
subsequently raised to pH 13.0 to remove the Msc 
protecting group (Treatment with TCEP was found to aid in 
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the purification and analysis of product by reducing the 
thiophenol and benzyl mercaptan disulfide products which 
had a tendency to co-elute with the peptide products) . 
Scheme 6 shows the progress of the reaction by HPLC; 

5 conversion of the starting peptides to the crude product 
is shown (scheme 6) . The 50 residue product has the 
expected molecular mass by electrospray mass spectrometry 
[observed, 5747.0 daltons; calculated (average isotope 
composition), 5747.4 daltons] . The ligation product is 

0 shown to be stable at high pH, reducing conditions, and 
forms an intramolecular disulfide bond. These 
observations are consistent with the presence of a native 
peptide bond at the site of ligation. 

Analogous methods have required removal of protecting 

5 groups (J. Blake et. al., Int. J . Pept. Protein Res. 
(1981): vol. 17, p 273; Kemp et. al. J. Org. Chem. 

(1993) : vol. 58, p 2216; Liu et. al. J. Am. Chem. Soc. 

(1994) : vol. 116, 4149) or conversion of intermediates to 
the final form, or both steps. No previous method has 

0 allowed the chemical reaction of unprotected peptide 
segments to directly yield a native backbone final 
product . 



Example 3 

The IL-8 (34-72) segment (Sequence No. 9) is prepared 
by optimized stepwise solid-phase methods as described by 
Kent et al., Int. J. Pept. Protein Res. (1992): vol. 40, 
180 and yield peptides from 60-80 residues in good yields 
and high purities. The peptide-aCOSH is prepared by 
optimized stepwise solid-phase peptide synthesis on a 
aminomethyl resin with a thioester linker. The thioester 
linker is prepared by a generalized version as adopted 
from J. Blake, Int. J. Pept. Protein Res. (1981): vol. 
17, p 273; D. Yamashiro and C.H. Li, ibid. vol. 31, 322 
(1988) . Products are subsequently purified by standard 
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reversed-phase HPLC conditions and characterized by 
standard methods which include electrospray mass 
spectrometry. 

Crude synthetic segment IL-8 (1-33) aCOSH (Sequence 
No.: 7) is converted to the thiobenzyl ester by reaction 
with benzyl bromide (15 equivalents) in 6 M guanidine-HCl 
at pH 4.6 in 100 mM sodium acetate buffer. The reaction 
mixture is purified under standard reversed-phase HPLC 
conditions and forms the thiobenzyl ester, IL-8(1- 
33 )aC0SBn (Sequence No. : 8), (Scheme 7) . 

The segments IL-8 (1-3 3) aCOSBn (Sequence No.: 8), (5.0 
mg, 1.3 jxmol) and IL-8(34-72) (Sequence No. 9), (4.8 mg, 
1.1 mmol) were reacted in 0.5 ml 6.0 M guanidine-HCl, pH 
7.6, phosphate buffer at 23 °C in the presence of benzyl 
mercaptan (5 ml) . After suitable reaction time (48 to 72 
hours) , a ligation yield of approximately 60% was 
obtained. The product was purified by standard reversed- 
phase HPLC as described via supra and characterized by 
electrospray mass spectroscopy. 

As shown in scheme 8B, an analytical HPLC spectrum 
(C 18 reversed phase; 25 to 45% acetonitrile at 1% per 
minute; monitored at 214 ran) is shown before the reaction 
of the synthetic peptide segments IL-8 (1-33) aCOSBzl and 
IL-8 (34-72) . 

As shown in scheme 8C, an analytical HPLC spectrum 
(C 18 reversed phase; 25 to 45% acetonitrile at 1% per 
minute; monitored at 214 nm) of the purified ligation 
product, IL-8 (1-72) (SH) 4 (Sequence No.: 10), in fully 
reduced form. (Inset) Electrospray mass spectrum (raw 
data displayed as a single charge state) : observed 
molecular mass 8319.8 daltons; calculated molecular mass 
(average isotope composition), 8319.8 daltons. 

As shown in scheme 8D, air oxidation of the purified 
1-72 ligation product forms the folded [Ala 33 ] IL-8 
molecule, shown after HPLC purification. The earlier 
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elution of the folded, disulfide cross-linked native 
protein compared with the reduced polypeptide is typical 
(see Lewis et. al FEB Lett. (1989): vol. 307, p 97; 
Lewis et al J. Biol Chem. : vol. 269, p 16075; Lewis et. 
5 al Biochemistxry (1991): vol. 30, 3128). 

Folding and oxidation conditions: polypeptide at -0.2 
mg/ml, 1M guanidine-HCl, pH 8.5 tris buffer t and vigorous 
stirring in air at ambient temperature (inset). 
Electrospray mass spectrometry of the oxidized and folded 
10 synthetic IL-8 (raw data displayed as a single charge 
state). Observed molecular mass, 8315.6 daltons; 
calculated molecular mass (average isotope composition) , 
8315.8 daltons. (Scheme 8). 



15 Example 4: HIV-1 K41 protease (unpublished conditions) 
Ligation reactions are performed in several ways. 
An optimized procedure for a ligation reaction involving 
a (5-thio-2-nitrobenzoic acid) SNB thioester is to weigh 
the two peptides, HIV (1-40)-COSNB (Sequence No.: 11, 

20 formed from standard conditions stated herein) and HIV 
(41-99) (Sequence No.: 12, formed from standard 
conditions stated herein) , as solids in the same reaction 
vessel and add 6.0 M guanidine HC1 pH 6.5 with 100 mM Na 
acetate (the approximate peptide concentration is 7-13 

25 mg/mL of each peptide) . 

After 5 min, approx. 2.0 % thiol is added. Two 
thiol catalysts have been used, viz. benzyl mercaptan 
(forms the benzyl thioester insitu; Sequence No.: 13) and 
thiophenol (forms the phenyl thioester insitu; Sequence 

30 No.: 14). In the ligation of HIV PR, reaction with benzyl 
mercaptan gave greater than 60% product yield in 40 hours 
while the thiophenol gave greater than 80% product yield 
in 10 hours to form HIV-1 K41 protease (Sequence No.: 
15) . 

35 Subsequent treatment with TCEP was found to aid in 
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the purification and analysis of product by reducing the 
thiophenol and benzyl mercaptan disulfide products which 
tend to co-elute with peptide products. The product was 
purified by standard reversed-phase HPLC as described via 
5 supra and characterized by electrospray mass spectroscopy 
(Scheme 9) . 

Example 5. Barnase example (unpublished conditions) 

The two peptides Barnase (1-48) -SNB (Sequence No.: 

10 16, formed from standard conditions stated herein) and 

Barnase (49-110) , (Sequence No. : 17, formed from standard 
conditions stated herein) were weighed as solids in the 
same reaction vessel and dissolved in pH 7.5 buffer (6M 
Guanidine lOOmM phosphate) . Immediately upon dissolving 

15 the peptides, 2% benzyl mercaptan (forms the benzyl 
thioester insitu; Sequence No.: 18) or 4% thiophenol 
(forms the phenyl thioester insitu; Sequence No.: 19) was 
added. After 7 hours, the benzyl mercaptan reaction 
proceeded 25% and the thiophenol reaction proceeded to > 

20 90% to form Barnase (1-110) (Sequence No.: 20). The 
product was purified by standard reversed-phase HPLC as 
described via supra (scheme 10). 

The addition of thiols increases the reactivity of 
the thioester, particularly if the added thiol is a 

25 better leaving group than the pre-formed thioester. An 
example of this observation is when the benzyl ester is 
converted to a phenyl ester by addition of thiophenol to 
reaction. Reaction yields and rates are substantially 
increased. For example, after 7 hours with benzyl 

30 mercaptan, the Barnase reaction yielded 25%, while the 
thiophenol treatment to the same reaction mixture yielded 
90% to form Barnase (1-110) (Sequence No.: 20). 

Addition of thiols to the ligation mixture also keeps 
the reaction mixture in a reduced form. This prevents 

35 oxidation of the reactive N-terminal Cys residues and 



WO 96/34878 



- 39 - 



PCT/US95/05668 



Mutant HIV-1 K41 Protease 



Synthesized by Native Chemical Ligation 



1-40 41-99 

NH 2 — COSR + Cys COOH 



I 



1-40 41*99 
NH 2 Cys COOH 




Scheme 9 



WO 96/34878 



PCTAJS95/05668 



- 40 - 

when internal Cys residues are present, the thiols reduce 
the formation of intramolecular disulfide bonds. 
Additionally, the reducing environment increases the 
stability of the thioester segment (ligation reactions 
can proceed overnight with little or no hydrolysis at pH 
7.5) . 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: The Scripps Research Institute 

(B) STREET: 10666 North Torrey Pines Road, Suite 220, 

Mail Drop TPC8 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 92037 

(G) TELEPHONE: 619-554-2937 

(H) TELEFAX: 619-554-6312 



(ii) TITLE OF INVENTION: SYNTHESIS OF PROTEINS BY NATIVE CHEMICAL 
LIGATION 

(iii) NUMBER OF SEQUENCES: 20 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

<D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(v) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: PCT/US 
IB) FILING DATE: 04-MAY-1995 



(2) INFORMATION FOR SEQ ID NO:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 
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(A) NAME/KEY: Modified-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = COSH 
/note= "Wherein COSH is thioacid." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 

Leu Tyr Arg Ala Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amjno acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Cys Arg Ala Glu Tyr Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label - COSBn 

/note= "Wherein COSBn is benzyl thioester." 



WO 96/34878 



PCT/US95/05668 



- 44 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Leu Tyr Arg Ala Gly 
1 5 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 5 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = X 

/note= "Wherein X is N-acetyl-cysteine-thioester." 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Leu Tyr Arg Ala Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 1 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Leu Tyr Arg Ala Gly Cys Arg Ala Glu Tyr Ser 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label = SCH2COOH 

/note= "Wherein SCH2COOH is 2-thioacetic acid. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Leu Tyr Arg Ala Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 33 

(D) OTHER INFORMATION: /label = COSH 
/note= "Wherein COSH is thioacid." 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label = Msc 
/note= "Wherein Msc is 
2-methyl-sulfonyl-ethyloxy-carbonyl." 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Ser Ala Lys Glu Leu Arg Cys Gin Cys He Lys Thr Tyr Ser Lys Pro 
15 10 15 

Phe His Pro Lys Phe lie Lys Glu Leu Arg Val He Glu Ser Gly Pro 
20 25 30 



Ala 



(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 33 

(D) OTHER INFORMATION: /label = COSBn 

/note= "Wherein COSBn is benzyl thioester." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Ser Ala Lys Glu Leu Arg Cys Gin Cys lie Lys Thr Tyr Ser Lys Pro 
15 10 15 

Phe His Pro Lys Phe lie Lys Glu Leu Arg Val lie Glu Ser Gly Pro 
20 25 30 



Ala 



(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Cys Ala Asn Thr Glu He He Val Lys Leu Ser Asp Gly Arg Glu Leu 
15 10 15 

Cys Leu Asp Pro Lys Glu Asn Trp Val Gin Arg Val Val Glu Lys Phe 
20 25 30 

Leu Lys Arg Ala Glu Asn Ser 
35 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 72 

(D) OTHER INFORMATION: /label « SH4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ser Ala Lys Glu Leu Arg Cys Gin Cys He Lys Thr Tyr Ser Lys Pro 
15 10 15 

Phe His Pro Lys Phe He Lys Glu Leu Arg Val He Glu Ser Gly Pro 
20 25 30 

Ala Cys Ala Asn Thr Glu He He Val Lys Leu Ser Asp Gly Arg Glu 
35 40 45 
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Leu Cys Leu Asp Pro Lys Glu Asn Trp Val Gin Arg Val Val Glu Lys 
50 55 60 

Phe Leu Lys Arg Ala Glu Asn Ser 
65 70 

(2) INFORMATION FOR SEQ ID NO:1 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 40 

(D) OTHER INFORMATION: /label = COSNB 

/note = "Wherein COSNB is 5-thio-2-nitro-benzoic 
acid ester." 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 27 

<D) OTHER INFORMATION: /label = Xaa 

/note= "Wherein Xaa is 2-Aminobutyric acid." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1 1: 

Pro Gin lie Thr Leu Trp Lys Arg Pro Leu Val Thr lie Arg He Gly 
1 5 10 15 

Gly Gin Leu Lys Glu Ala Leu Leu Asp Thr Gly Ala Asp Asp Thr Val 
20 25 30 

He Glu Glu Met Asn Leu Pro Gly 
35 40 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 59 amino acids 

(B) TYPE: amino acid 

(C) STR ANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 27 

(D) OTHER INFORMATION: /label = Xaa 

/note= "Wherein Xaa is 2-Aminobutyric acid." 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 55 

(D) OTHER INFORMATION: /label = Xaa 

/note= "Wherein Xaa is 2-Aminobutyric acid." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Cys Trp Lys Pro Lys Met lie Gly Gly lie Gly Gly Phe lie Lys Val 
15 10 15 

Arg Gin Tyr Asp Gin lie Pro Val Glu lie Xaa Gly His Lys Ala He 
20 25 30 

Gly Thr Val Leu Val Gly Pro Thr Pro Val Asn lie lie Gly Arg Asn 

35 40 45 

Leu Leu Thr Gin lie Gly Xaa Thr Leu Asn Phe 
50 55 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 40 

(D) OTHER INFORMATION: /label = COSBn 
/note= "Wherein COSBn is ??." 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 40 

(D) OTHER INFORMATION: /label = COSBn 

/note= "Wherein COSBn is benzyl thio ester." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Pro Gin lie Thr Leu Trp Lys Arg Pro Leu Val Thr He Arg He Gly 
1 5 10 15 

Gly Gin Leu Lys Glu Ala Leu Leu Asp Thr Gly Ala Asp Asp Thr Val 
20 25 30 

He Glu Glu Met Asn Leu Pro Gly 
35 40 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 40 

(D) OTHER INFORMATION: /label = COSPh 

/note= "Wherein COSPh is phenyl thioester." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
Pro Gin He Thr Leu Trp Lys Arg Pro Leu Val Thr He Arg He Gly 
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15 10 15 

Gly Gin Leu Lys Giu Ala Leu Leu Asp Thr Gly Ala Asp Asp Thr Val 
20 25 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 67 

(D) OTHER INFORMATION: /label = Xaa 

/note= "Wherein Xaa is amino butyric acid." 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 95 

(D) OTHER INFORMATION: /label = Xaa 

/note= "Wherein Xaa is 2-Aminobutyric acid." 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Pro Gin lie Thr Leu Trp Lys Arg Pro Leu Val Thr lie Arg He Gly 
15 10 15 

Gly Gin Leu Lys Glu Ala Leu Leu Asp Thr Gly Ala Asp Asp Thr Val 
20 25 30 

lie Glu Glu Met Asn Leu Pro Gly Cys Trp Lys Pro Lys Met He Gly 
35 40 45 

Gly lie Gly Gly Phe He Lys Val Arg Gin Tyr Asp Gin He Pro Val 
50 55 60 

Glu He Xaa Gly His Lys Ala He Gly Thr Val Leu Val Gly Pro Thr 
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65 70 75 80 

Pro Val Asn He lie Gly Arg Asn Leu Leu Thr Gin He Gly Xaa Thr 
85 90 95 

Leu Asn Phe 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 48 

(D) OTHER INFORMATION: /label = COSNB 

/note = "Wherein COSNB is 5-thio-2-nitro benzoic 
acid ester." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ala Gin Val He Asn Thr Phe Asp Gly Val Ala Asp Tyr Leu Gin Thr 
15 10 15 

Tyr His Lys Leu Pro Asn Asp Tyr He Thr Lys Ser Glu Ala Gin Ala 
20 25 30 

Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp Val Ala Pro Gly 
35 40 45 



(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Cys Ser He Gly Gly Asp He Phe Ser Asn Arg Glu Gly Lys Leu Pro 
15 10 15 

Gly Lys Ser Gly Arg Thr Trp Arg Glu Ala Asp He Asn Tyr Thr Ser 
20 25 30 

Gly Phe Arg Asn Ser Asp Arg He Leu Tyr Ser Ser Asp Trp Leu He 
35 40 45 

Tyr Lys Thr Thr Asp His Tyr Gin Thr Phe Thr Lys lie Arg 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 48 

(D) OTHER INFORMATION: /label = COSBn 

/note= "Wherein COSBn is benzyl thio ester. 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:18: 

Ala Gin Val He Asn Thr Phe Asp Gly Val Ala Asp Tyr Leu Gin Thr 
15 10 15 

Tyr His Lys Leu Pro Asn Asp Tyr He Thr Lys Ser Glu Ala Gin Ala 
20 25 30 
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Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp Val Ala Pro Gly 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 48 

(D) OTHER INFORMATION: /label = COSPh 

/note= "Wherein COSPh is phenyl thio ester." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

Ala Gin Val He Asn Thr Phe Asp Gly Val Ala Asp Tyr Leu Gin Thr 
15 10 15 

Tyr His Lys Leu Pro Asn Asp Tyr He Thr Lys Ser Glu Ala Gin Ala 
20 25 30 

Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp Val Ala Pro Gly 
35 40 45 



(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Ala Gin Val lie Asn Thr Phe Asp Gly Val Ala Asp Tyr Leu Gin Thr 
15 10 15 

Tyr His Lys Leu Pro Asn Asp Tyr lie Thr Lys Ser Glu Ala Gin Ala 
20 25 30 



Leu Gly Trp Val Ala Ser Lys Gly Asn Leu Ala Asp Val Ala Pro Gly 
35 40 45 

Cys Ser He Gly Gly Asp He Phe Ser Asn Arg Glu Gly Lys Leu Pro 
50 55 60 

Gly Lys Ser Gly Arg Thr Trp Arg Glu Ala Asp He Asn Tyr Thr Ser 
65 70 75 80 



Gly Phe Arg Asn Ser Asp Arg He Leu Tyr Ser Ser Asp Trp Leu He 
85 90 95 



Tyr Lys Thr Thr Asp His Tyr Gin Thr Phe Thr Lys He Arg 
100 105 110 
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What is claimed is: 

1. A method for ligating a first oligopeptide with a 
5 second oligopeptide end to end for producing an 

oligopeptide product, the method comprising the following 
steps : 

Step A: admixing the first and second oligopeptides in 
a reaction solution including a catalytic thiol, the 
10 first oligopeptide including a C-terminal thioester, the 
second oligopeptide including an N-terminal cysteine 
having an unoxidized sulfhydryl side chain; then 

Step B: condensing the unoxidized sulfhydryl side 
chain of the N-terminal cysteine with the C-terminal 
15 thioester for producing an intermediate oligopeptide 
linking the first and second oligopeptides with a fi- 
aminothioester bond; and then 

Step C: rearranging the ^-aminothioester bond of the 
intermediate oligopeptide of said Step B for producing 
20 the oligopeptide product linking the first and second 
oligopeptides with an amide bond. 

2. A method as described in Claim 1 wherein, in said 
step A, the catalytic thiol is selected from the group 

25 consisting of unconjugated mercaptans and conjugated 
thiols. 

3. A method as described in Claim 2 wherein, in said 
step A, the catalytic thiol is benzyl mercaptan. 

30 
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4. A method as described in Claim 2 wherein, in said 
step A, the catalytic thiol is a conjugated thiol 
selected from the group consisting of thiophenol, 1-thio- 

5 2-nitrophenol, 2-thio-benzoic acid, 2-thio-pyridine, 4- 
thio-2-pyridinecarboxylic acid, and 4-thio-2-nitro- 
pyridine. 

5. A method as described in Claim 4 wherein, in said 
10 step A, the conjugated thiol is thiophenol • 

6. An oligopeptide intermediate comprising: 

a first oligopeptide segment having a C-terminal 
thioester , 

15 a second oligopeptide segment having a N-terminal 

cysteine, and 

a ^-aminothioester linkage unit linking the C- 

terminal thioester and the N-terminal cysteine, said p- 

aminothioester linkage unit spontaneously rearranging 
20 intramolecular ly to form an amide bond linking said first 

and second oligopeptides segments end to end. 



WO 96/34878 PCT/US95/05668 

- 58 - 

7. A method for producing an oligopeptide having a C- 
terminal thioester, the method comprising the following 
steps : 

5 Step A: providing a resin having a linker with an 

unoxidized thiol; 

Step B: providing a Boc-amino acid succinimide ester; 
then 

Step C: : admixing the resin of said Step A and the Boc- 
10 amino acid succinimide ester of said Step B under 
reaction conditions for producing a Boc-amino thioester- 
resin; then 

Step D: assembling an oligopeptide onto the Boc-amino 
thioester-resin by stepwise solid phase peptide 
15 synthesis; then 

Step E: cleaving the Boc-amino thioester-resin of said 
Step D with HS for producing an oligopeptide having a C- 
terminal thiol; and then 

Step F: converting the oligopeptide having a C-terminal 
20 thiol of said Step E to the oligopeptide having a C- 
terminal thioester. 
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